Dataset-invariant covariance normalization for out-domain PLDA speaker verification

نویسندگان

Md. Hafizur Rahman

Ahilan Kanagasundaram

David Dean

Sridha Sridharan

چکیده

In this paper we introduce a novel domain-invariant covariance normalization (DICN) technique to relocate both in-domain and out-domain i-vectors into a third dataset-invariant space, providing an improvement for out-domain PLDA speaker verification with a very small number of unlabelled in-domain adaptation i-vectors. By capturing the dataset variance from a global mean using both development out-domain i-vectors and limited unlabelled in-domain i-vectors, we could obtain domaininvariant representations of PLDA training data. The DICNcompensated out-domain PLDA system is shown to perform as well as in-domain PLDA training with as few as 500 unlabelled in-domain i-vectors for NIST-2010 SRE and 2000 unlabelled in-domain i-vectors for NIST-2008 SRE, and considerable relative improvement over both out-domain and in-domain PLDA development if more are available.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification

The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non-target (or outdomain) data significantly affects the speaker verification performance due to the domain mismatch between training and evaluation data. To improve the speaker verification performance, sufficient amount of domain mismatch compensated out-domain data must be used to train the PLDA ...

متن کامل

Domain adaptation based Speaker Recognition on Short Utterances

This paper explores how the inand out-domain probabilistic linear discriminant analysis (PLDA) speaker verification behave when enrolment and verification lengths are reduced. Experiment studies have found that when full-length utterance is used for evaluation, in-domain PLDA approach shows more than 28% improvement in EER and DCF values over out-domain PLDA approach and when short utterances a...

متن کامل

PLDA based speaker recognition on short utterances

This paper investigates the effects of limited speech data in the context of speaker verification using a probabilistic linear discriminant analysis (PLDA) approach. Being able to reduce the length of required speech data is important to the development of automatic speaker verification system in real world applications. When sufficient speech is available, previous research has shown that heav...

متن کامل

Compensating Inter-Dataset Variability in PLDA Hyper-Parameters for Robust Speaker Recognition

Recently we have introduced a method named inter-dataset variability compensation (IDVC) in the context of speaker recognition in a mismatched dataset. IDVC compensates dataset shifts in the i-vector space by constraining the shifts to a low dimensional subspace. The subspace is estimated from a heterogeneous development set which is partitioned into homogenous subsets. In this work we generali...

متن کامل

Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis

I-vector extraction and Probabilistic Linear Discriminant Analysis (PLDA) has become the state-of-the-art configuration for speaker verification. Recently, Gaussian-PLDA has been improved by a preliminary length normalization of i-vectors. This normalization, known to increase the Gaussianity of the i-vector distribution, also improves performance of systems based on standard Linear Discriminan...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

Dataset-invariant covariance normalization for out-domain PLDA speaker verification

نویسندگان

چکیده

منابع مشابه

Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification

Domain adaptation based Speaker Recognition on Short Utterances

PLDA based speaker recognition on short utterances

Compensating Inter-Dataset Variability in PLDA Hyper-Parameters for Robust Speaker Recognition

Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis

عنوان ژورنال:

اشتراک گذاری